SOLVING THE 100 SWISS FRANCS PROBLEM 



SHUHONG GAO, GUANGRAN JIANG, AND MINGFU ZHU 

Abstract. Sturmfels offered 100 Swiss Francs in 2005 to a con- 
jecture, which deals with a special case of the maximum likelihood 
estimation for a latent class model. A positive solution for this 
conjecture is presented in this paper. 



1. The conjecture and its statistics background 

Sturmfels [7] proposed the following problem: Maximize the likeli- 
hood function 

i=l i^j 

over the set of all 4x4- matrices P = {pij) whose entries are nonnegative 
and sum to 1 and whose rank is at most two. Based on numerical ex- 
periments by employing an EM algorithm, Sturmfels [5,7] conjectured 
that the matrix 

/3 3 2 2\ 
l_ 3 3 2 2 
40 2 2 3 3 
\2 2 3 3/ 

is a global maximum of L{P). He offered 100 Swiss francs for a rigorous 
proof in a postgraduate course held at ETH Ziirich in 2005. 

Partial results were given in the paper [2], where the general sta- 
tistical background for this problem is also presented. This problem 
is a special case of the maximum likelihood estimation for a latent 
class model. More precisely, by following [2], let (Xi,...,^^) be a 
discrete multivariate random vector where each Xj takes value from 
a finite state set Sj = {1, . . . , sj}. Let fl = ^^^i/Sj be the sample 
space. For each {xi, . . . ,Xd) € Q, the joint probability mass function 
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of {Xi, . . . , Xd) is denoted as 

p{xi, ...,Xd) = P{iXi, ...,Xd) = (xi, . . .,Xd)}. 

The variables Xi,...,Xd may not be mutually independent gener- 
ally. By introducing an unobservable variable H defined on the set 
[r] = {1, . . . r}, Xi, . . . ,Xd become mutually independent. The joint 
probability mass function in the newly formed model is 

p{xi, . . . ,Xd,h) = P{{Xi, . . . ,Xd,H) = {xi, . . . ,Xd,h)} 
= p{xi\h) ■ ■ ■p{xd\h)Xh 

where Xh is the marginal probability of P{H = h} and p{xj\h) is the 
conditional probability P{Xj = Xj\H = h}. We denote this new r- 
class mixture model by Ti. The marginal distribution of {Xi, . . . , Xd) 
in 7i is given by the probability mass function (which is also called 
accounting equations [3]) 

p{xi,...,Xd) = = p{xi\h) ■ ■ ■ p{xd\h)Xh. 

h&[r] he[r] 

In practice, a collection of samples from Q are observed. For each 
{xi, . . . , Xd), let n{xi, . . . , Xd) G N be the number of observed occur- 
rences of (xi, . . . ,Xd) in the samples. While the parameters ■ ■ , 
p{xd\h), Xh,p{xi, . . . ,Xd) are unknown. The maximum likelihood esti- 
mation problem is to find the model parameters that can best explain 
the observed data, that is, to determine the global maxima of the like- 
lihood function 

{xi,...,Xd)&^ 

Since each p{xi, . . . , Xd) is nonnegative, it is equivalent but more 
convenient to use the log-likelihood function 

liH) = ^ n{xi,...,Xd)lnp{xi,...,Xd), (2) 

where we define ln(0) = —oo. Finding the maxima of ([2]) is difficult 
and remains infeasible by current symbolic software [1,4]. We can only 
handle some special cases: small models or highly symmetric table. 
The 100 Swiss francs problem is the special case of 7i when d = 2, 
S^ = 82 = {A,C,G,T}, si = S2 = 4 and r = 2. It is related to a DNA 
sequence alignment problem as described in [5]. In that example, the 
contingency table for the observed counts of ordered pairs of nucleotides 
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(i.e. AA, AC, AG, AT, CA, CC, 
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So the likelihood function ([2]) in this example is exactly ([T]). 

Even for this simple case, the problem is highly non-trivial and re- 
mains surprisingly difficult. We know that the global maxima must 
exist, as the region of the parameters is closed. By using EM algo- 
rithm or Newton-Raphson method and starting from suitable points, 
one can find some local maxima of the likelihood function. However, 
the global maximum property is not guaranteed. 

Our paper is organized as follows. We first derive some general prop- 
erties for optimal solutions in Section [21 then we prove the conjecture in 
SectiorO In Section |H we make some comments on why Grobner basis 
technique does not work efficiently, and suggest several new conjectures 
in more general cases. 

2. General Properties 

We focus on general n x n matrices P = (pij) in this section. For 
convenience we scale each entry of P by so the entries sum to n^, 
and take square root of the original likelihood function. So we may 
assume that 



hp) = iipIxI[p,,. 

i=l ii^j 



(3) 



The problem is 



n , and 



Maximize: L(P) 
Subject to: ^ 

l<i,j<n 

Pij > 0, 1 < i,j < n. 

Suppose P = {pij)nxn IS a global maximum of L{P). Apparently P 
cannot be the following n x n matrix 

a ■ 

\1 ... 1, 

as we have many other matrices (for instance, the resulting matrix by 
setting s = 2,t = 1 for the matrix in Conjecture [12] and then scaling 
by n^) with larger likelihood value. 
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Since the function ([3]) is a continuous function in Pi/s, if one of 
the entries of P approaches 0, the product has to approach too, as 
the other entries are bounded by n^. Hence the optimal solutions must 
occur in interior points and we don't need to worry about the boundary 
where some Pij = 0. 

In the following discussion, we assume that P ^ J and all its entries 
are positive. We show that P must have certain symmetry properties. 

Lemma 1. For an optimal solution P , its row sums and column sums 
must all equal n. 

n n 

Proof. Let J2 Pij = -^i- Then J2 = and Yl^i ^ with equality 

j=l 1=1 i 

if and only if Sj = ?7, for all i. Let pij = ^Pij and P = {pij)nxn- Then 
rank(P) = rank(P) and YlPij — ■ However, 

with equality if and only if Sj = n for all i. Since P is a global maximum, 
-^(-P) ^ P{P)- Therefore each row sum equals n. Similarly, each 
column sum equals n as well. □ 

We shall express P in a form that involves fewer variables and has 
no rank constraint. Since P has rank at most two, by singular value 
decomposition theorem, there are column vectors ui,U2,vi and V2 of 
length n such that 

P = aiUiv{ + a2U2v\ 

for some nonnegative numbers ai and (J2. By Proposition [T], P has 
equal row and column sums, so P has the vectors (1,1,..., 1) and 
(1,1,..., 1)* as its left and right eigenvectors both with eigenvalue 1. 
Hence we may assume that ai = 1 and mi = i;i = (1, 1, . . . , 1)*. Let 
V2 = (oi, 0,2, ... , OnY and a2U2 = (61, 62, • • • , KY- Then P has the form 

/bi\ n + aibi ■■■ l + ttnbi 

P = J+ : (ai,a2,...,a„)= : 1 + Oibj : 

\brj \1 + aibn ■■■ 1 + a-ahn 

In this form, P has rank at most two. Also, the condition Y^Pij ~ '^^ 
becomes 

n n 

5^a,-^6, = 0. (4) 

i=l i=l 
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We have transformed the original problem to the following optimiza- 
tion problem: 

n 

Maximize: 1{P) = 2J2 + aA) + J2 ln(l + aibj) 

i=l i^j 

Subject to: equation (jlj) and 1 + aibj > 0, I < i, j < n. 
The Lagrangian function would be 

n n 

A(P,A) = /(P) + A5^a,-5^6, 

i=l i=l 

where A G M. Any local extrema must satisfy 

^Y'^^ =y^S- + ^^ + AV6, = 0, l<^<n, (5) 

7=1 1=1 



and 



^^iP^ =y^V+ , +Aya. = 0, l<,<n. (6) 

1 + aibj 1 + ^ ~ 

By Lemma [1], for an optimal solution P, its row sums and column 
sums must be all equal to n. This means that 

^a. = 0, (7) 

i=\ 

and 

n 

^6, = 0. (8) 

1=1 

Plugging (171) and (IHl) into ([5]) and ([6]) respectively, we obtain the fol- 
lowing lemma. 

Lemma 2. A global maximum P must satisfy 

" b b 

+ 1 ^ = 0' l<^<n, (9) 

^ 1 + ttib. 1 + aibi ' - - ' V / 

i=i 



and 

y + "\ = 0, l<J<n. (10) 

^ 1 + ttifej 1 + ajbj 

Doing some simple algebra yields 

Corollary 3. An optimal solution must satisfy 

" 1 1 

V- - + - - = n + l, l<i<n, (11) 

^ 1 + aibi 1 + aibi , - - , v / 

j=i J 
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and 



n 



1 



1 



E 



1 + aibj 



+ 



1 + ajbj 



n + 1, 1 < j < n. 



(12) 



i=l 



n 




1 



+ 



1 



at both sides 



l+aibj 



l+aib; 



□ 



The 2n equations derived by clearing denominators of the equations 
in Lemma [2] or Corollary [3] along with equations (171) and (IHl) form a 
system S of 2n + 2 polynomial equations with 2n unknowns, whose 
solution contains all global maxima. From computational view point 
we may solve S by brute force, like utilizing Grobner basis method, 
and then compare the local extrema to achieve the global maximum. 
However current symbolic computation softwares like Magma, Maple, 
Singular and so on are still infeasible to compute a Grobner basis for 
n = 4 by using the computers available to us. 

Our strategy below is to prove that P should have high symmetry: 
firstly ttj's and 6j's are in the same order: if Oj > aj > 0, then hi > bj > 
correspondingly (Lemma H] and E]). For the case n = 4 once we force 
ai = bi by scaling, we can eventually prove Oj = bi for all other i's 
(Lemma [7| and [9]). With 4 a^'s remaining, we prove that the flj's with 
the same signs must be identical. Finally one can solve the system by 
hand. Note that in [2] they derived results similar to Lemmas H] and [5l 
but our approach is fairly different. 

Lemma 4. For every i, 

(1) ai = if and only if bi = 0, and 

(2) ai > if and only if bi > 0. 



Proof. (1) Plugging in = to the equation Q), we have Yl bj+bi = 0, 
thus bi = 0. Similarly, if bi = then = 0. 

(2) Note that g{x) = ^ is concave up in (0, oo). By Jensen's Inequality, 



n 




That is 
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Compare with equation ( ITTi) . we get 

1 



< 1, 



1 + aA 

so ttibi > 0. We conclude that > if and only if 6^ > 0. 

Lemma 5. For i and j , 

(1) Oj = ttj if and only if hi = bj, and 

(2) Oj > Oj if and only if hi > bj . 

Proof. (1) Suppose hi = hj. Then, by ( ITOi) . 

n n 
0-k 0,i 



□ 



k=l 



+ akbi 1 + aibi 



Oand + 
^ 1 + Okb. 



k=l 



1 + ajbj 



0. 



Then 



l+aibi 



Ui = Uj then bi 



l+ajbj ' 



so = Oj. Similarly, using ([9]), we have that if 



(2) Switch bi, bj in P to form a new matrix P. Then we should have 
L(P) > L(P). Note that 

L{P)-L{P) = Ci- {{l + aihif{l + aibj){l + ajhi){l + ajh^f 

-(1 + ai6j)^(l + aihi){l + ajhj){l + ajbif) 

= C2 ■ ((1 + aihi){l + ajhj) - (1 + aihj){l + ajh)) 

= C2 ■ {ttibi + ttjbj - aihj - ajbi) 

= C2 ■ {ai - aj){bi - hj) 

where Ci, C2 are products of some entries of P, so Ci, C2 are positive. 
Thus (ttj — aj){hi — hj) > 0. Note that = aj if and only if bi = bj by 
part(l), we conclude that > aj if and only if hi > bj. □ 

3. Proof for the conjecture 

We complete the proof for the conjecture in this section. From 
now on we focus on the case when n = 4. By Lemma [5l we can 
always assume ai > a2 > as > 04 and &i > &2 > &3 > We 
know fli 7^ 0, otherwise P = J. So we replace (01,02,03,04) in P 

by ^/f^(oi, 02, 03, 04) and (61, 62, ^a, h^Y by \J~^^{bi, &2, f's, It turns 



out that 1 + 



1 + aihj for any i and j, so we may always 



assume Oi = 61. Thus P can be expressed as the form 



/ 1 



1 + 0201 1 + 0301 1 + a40i\ 



1 + 0162 1 + 0262 1 + 0362 1 + 0462 
1 + 0163 1 + 0263 1 + 03&3 1 + 0463 

yi + 0164 1 + 0264 1 + 0364 1 + 0464 y 



(13) 
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If a2 < 0, we then replace (ai, 02, 03, 04) in P by (—04, —03, —02, — ai) 
and (61, 62, ^3, ^4)* by (—64, —63, —621 — ^i)*- The new matrix with —04 > 
—03 > has the same hkehhood function as P. Thus we may assume 
c^i > fl2 > 0. Without loss of generality, we may make the following 
assumption. 

Assumption 6. We can always assume the following 

(1) ai > a2 > as > and bi > b2 > b-^ > b^^, 

(2) fli = 61 > 0, and 

(3) ai > 02 > 0. 

The results in the rest of this section are all based on Assumption [6l 
Our first goal is to prove 02 = 62- 

Lemma 7. 02 = fo2- 

Proof. If one of 02, 62 is 0, then 02 = 62 = by Lemma HI We assume 
that both 02, 62 are nonzero. 

Apply Corollary [3] to the first row of matrix fll3p . We have 

2 111 
+ -. + -. + 



1 + a{ 1 + 0201 1 + 0301 1 + 0401 
Also 

+ 02(31 + 0301 + a^itti = 0. 
Prom above two equations we get 

a^ai ■ 0401 = /i(aiai, 0102) (14) 

where fi is a bivariate function in x,y defined as 

2 — X — y 

fi{x,y) = 2 — + x + y-l. 

l+x 1+y 

Similarly, apply Corollary [3] to the second row of matrix f[T^ . We get 

12 1 1 ^ 
+ r- + r- + z r- = 5. 



1 + 0162 1 + ^2^2 1 + 0-3^2 1 + 0-4^2 

Along with 

0162 + 02^2 + 03^*2 + 04^2 = 0, 

we get 

0362 ■ 0462 = /i (02^*2, aib2). (15) 
Since ai, 62 are nonzero, we combine equations f|T^ and ([T5l) to get 

fi{al,aia2) _ /i(a2^2, ai&2) 

aj - bl ■ ^''^ 
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Normalizing ( |T6l) we can derive a trivariate polynomial equation, say 

/2(ai,a2,62) = 0. (17) 

Symmetrically apply Corollary [3] to the first column and the second 
column [T3| we get 

fi{al,aib2) fi{a2b2,aia2) 



(1^ 



One can see that equation (fTSjl is obtainable by switching 02 with 62 
in equation (fT6|) . Thus we have 

/2(ai,&2,a2) = 0. (19) 

Subtracting (fT9l) from (ITTj) yields 

72(01,02,62) - f2{ai,b2,a2) = 0. 

Since we only switched 02 and 62 in polynomial /2, there must be a 
factor a2 - 62 for /2(ai, 02, 62) - /2(ai, 62, ^2), say 

(a2-62)/3(ai,a2,62) = 0, (20) 

where 

fsiai, 02, 62) = (200^62 + 150^62 + Sajbl + 2aib2 - ^bDaj 

+ {3a% + 15albl + 2al + 10alb2 + 2aibl - 3ai - 62)^2 

- Aat + 2alb2 - a\- 80162 - 2. 

Thus a2 = 62 if /3(ai, 02, 62) 7^ 0. This is true because we have some 
bounds for Ol , (l\(l2 , 0162 as presented in Lemma [8] below, which can be 
applied to get 

/3(ai, 02, 62) = {2^a\b\ + 15a?62 + 'ialbl + 2ai62 - ^b\)a\ 

+ (80162 + 15ai^6^ + 2a\ + 10a^62 + 2ai6^ - 8ai - 62)02 

- 4a^ + 20^62 -a\- 80162 - 2 

20 15 3 9 9 2 9 9 

< ^ + ^ + 4«2^2 + 5^262 - 40^6^ 



15 2 10 2 , 

+ 7377 + + -0262 - 0262 



2252 53 225 52 5 
2 

18 2,2 1 , 549 

< 0969 0262 

4^2 5 500 

< 0. 



Therefore, fsi^ai, a2, 62) 7^ and 02 = 62, just as needed. □ 
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Lemma 8. 

(1) al < \, 

(2) < aia2 < \, and 

(3) < ai62 < |. 

4 

Proof. (1) Let A = 1 + oicti for i = 1, . . . , 4, then ^ Ai = A, Ai > 



A2 > I, A3 > A^ > and 



1=1 



2 111 
A3 A4 

Since 

1 1 ^ 4 _ 4 

we have 

2 1112 1 4 

A A2 A3 A4 Al A2 4-A1-A2 

Let 

where is a function in M.[x]. Then 

a^7(A2) _^ ^ 4 



aA2 A2 (4-Ai-A2)2- 

rai — ^2 ^ ^ ciii'-t 

5'(A) > fi'(l) for A2 > 1, that is, 



Note that Ai > A2 > 1, thus 4- Ai - A2 < 2 and > 0. Therefore 



1 4 

-^ + -. : ^ > 1 + 



A2 4 - Al - A2 - 3 - Al 
Hence by inequahty fl2T]) . 

2 14 2 4 

5 > -r + ^ + ^ A 7- > -r + 1 + n — ^• 

~ Al A2 4 - Al - A2 Al 3 - Al 

We get 2A2 _ 5^1 + 3 < 0, i.e. 1 < Ai < |. Thus aj < i. 

(2) Assume A2 = 1 + aiOa > f • Then 5f(A2) > That is, 

2 1 4 2 5 4 

5 > ^ + + 1 : 7- > ^ + - + 



Al ' A2 ' 4 - Al - A2 ' Al ' 6 ' ^ - Al • 

The solution set of Ai is {-00, 0) U (§, |) U 00). Note that Ai > 
and Al = 1 + af < |, we then get || < Ai < |, which contradicts with 
Al > A2. Thus A2 < I and < aia2 < |. 
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(3) This result is followed by letting Ai = 1 + of and Ai = 1 
for i > 2. The above proofs in part (1) and (2) remain good. 



aibi 
□ 



Lemma 9. 

Proof. Let Ai 



and 



= bi for i = 3,4. 
1 + ajfei for i = 



, 4. Then 

: 4 



i=l 



2 11 

Ai A2 As 



1 

A', 



5. 



By the above two equations, since ^3 > A^, we can derive explicit 
expression for ^3,^44 in the variables Ai,A2, say A3 = hi{Ai,A2) and 
A4 = h2{Ai, A2). If we let Bi = 1 + aibi, we can get B3 = hi{B\, B2) 
and Bi = /12 (-81,-82) in a similar manner. Note that Ai = Bi and 
A2 = 1 + 02^1 = 1 + &2C^i = B2, we deduce that Ai = Bi for z = 3,4. 
Since ai = 61 > 0, aj = bi for z = 3, 4. □ 



By Lemmas [7| and [H we have Oj 
expressed as 



bi for all z. Hence P can be 



/ 1 + 1 + a2ai 1 + 0301 1 + a4ai\ 

1 + aia2 1 + a2 1 + ^302 1 + 0402 

1 + aia3 1 + 0203 1 + c^s 1 + 0,4(13 

\l + aia4 1 + 0204 1 + 0304 1 + al J 



where 



0. 



(22) 



1=1 



By Corollary [3] we have the following system of equations 



+ 



1 + a2ai 
2 



+ 



+ 



- aia2 
1 



+ 



1 + aitts 
1 



+ 



1 



1 + 0301 
1 

1 + 0302 
2 



+ 



1 + 0203 
1 



+ 



l + a2 



+ 



1 



1 + ttitti 
1 

1 + 0402 
1 

1 + a^as 
2 



1 + aia4 1 + 0204 1 + 0304 1 
With ([22]) and ([23]), we claim that 

Lemma 10. a, = aj if aittj > 0. 



5, 
5, 
5. 



(23) 
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Proof. Let 

F{x) = — ^ + — ^ + — ^ + — ^ + -^-^ -5 = 0. 

1 + aix 1 + 1 + 030; 1 + 040; 1 + 

Normalizing F{x) yields a polynomial (the numerator) of degree 6 in 

4 

X whose constant is and whose coefficient of the term x is ^ = 0. 

i=l 

So 01,02,03,04,0,0 are all the zeros of F(x). Suppose there exists 
consecutive i,j such that Oj > Oj > (or aj < Oj < respectively). 
Then F(x) is continuous in the interval (— — , — — ). Note that 



lim F{x) = 00 and lim F{x) = —00. 

There must be a zero lying in {—^,—-^), say oq. Then oq < — ^ (or 

00 > — ^ respectively), i.e. 1 + OjOq < (or 1 + ojOq < respectively). 
Since oq 7^ 0, xq must be one of o^, k = 1, ... ,4. Thus 1 + OjOo (or 

1 + OjOq, respectively) is an entry in matrix P, contradicting the fact 
that each entry of P is positive. Therefore if i,j are consecutive and 
OjOj > 0, we must have Oj = aj. Hence OjOj > implies Oj = aj for any 



With Lemma [TOl it is handy to solve the system fl23|) . Under Assump- 
tion ([6]) there are only 4 possible patterns of signs for (oi, 02,03, 04). If 
the signs are (+, +, +, — ), then 01 = 02 = 03 = — |a4. Substitute this 
to any equation in (l23i) yields oi = 02 = 03 = and 04 = The 
matrix would be 



/16 

15 


16 


16 




15 


15 


5 


16 


16 


16 


4 


15 


15 


15 


5 


16 


16 


16 


4 


15 


15 


15 


5 


\l 


4 
5 


4 

5 


V 



For the case when the signs are (+, +, — , — ), we get (^1 = and the 
matrix would be 

/6 6 4 4\ 

5 5 5 5 

6 6 4 4 
p _ 5 5 5 5 
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When the signs are 



-,o, 



fli = and the matrix would be 



' 8 

9 
8 

1 



And when the signs are (+, 0, 0, — ), ai 



be 



and the matrix would 



/- 

' 3 

1 

1 

2 
.3 



1 



The matrices obtaining local maximum of the likelihood function 
must be among the matrices above. We conclude that matrix P2 ob- 
tains the global maximum. Finally, multiplying matrix P2 by ^ yields 



P 



1 

40 



/3 

3 
2 

V2 



2\ 
2 

3 

3/ 



4. Some comments on general cases and discussion 

A natural question arises here is that why Grobner basis technique 
[6], the most powerful tool for solving systems of polynomial equa- 
tions, does not work efficiently for this particular problem. We get 
a polynomial (02 — b2)/3(«i, 02, ^2) in the proof for Lemma ([7]). Our 
approach is to justify the factor /3(ai, 02, 62), a trivariate polynomial 
with 17 terms, is nonzero by applying some bounds from Lemma [H so 
that we can derive the simplest equation 02 — 62 = 0. In the proof we 
used the fact that we are looking only for real solutions. However, a 
Grobner basis encodes both real and complex solutions. It is possible 
that fz{0'i-,0'2-,b2) is zero for some complex solutions. The locus of all 
solutions may be much more complicated than that of real solutions, 
hence the Grobner basis is much more time consuming to compute. 

The limitation of our work is that the arguments in the last section 
only work for n = 4. For n > 4 we are still lack of an efficient method. 
However, our methods can still be applied to general likelihood func- 
tions. In the following, we illustrate for a few examples. 
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Conjecture 11. For given < t < s where t, s are two integers, among 
the set of all non-negative 4x4 matrices whose rank is at most 2 and 
whose entries sum to 1, the matrix 



1 



4s + I2t 



/ s+t s+t 
'2 2 



t 



s+t s+t j. 
2 2 



t \ 



+ + s+t 
\ 2 2 / 



is a global maximum for the following likelihood function 



p. 



(24) 



The results in Section [2] remain good for this hkehhood function. 
The equation (ITUl) becomes 



hi 



+ 



1 + aj6i 1 + Oih 



Oih. 



+ 



1 + ttibi 



0. 



But the bounds in Lemma M involve the fraction | and become com- 



plicated. A similar equation to fl20|) can be derived, but the nonzero 
factor is difficult to claim. Hopefully we may also prove a2 = 62- Then 
03 = &3 and a4 = 64 can be derived in a similar manner to Lemma M 
So does Lemma [101 Finally we can find 4 local extrema and need only 
compare them to obtain the global maximum. In the case when the 
signs of (oi, a2, as, 04) are (+, +, +, — ), we have the equation 



al{{3s + 9t)al - {s - t)) = 0. 



Thus ai 



s-t 
3s+9t ' 



and the matrix would be 



Pi 



4s+8t 


4s+8t 


4s+8t 


12t 


3s+9t 


3s+9t 


3s+9t 


3s+9t 


4s+8t 


4s+8t 


4s+8t 


I2t 


3s+9t 


3s+9t 


3s+9t 


3s+9t 


4s+8i 


4s+8t 


4s+8t 


I2t 


3s+9t 


3s+9t 


3s+9t 


3s+9t 


12t 


12t 


12t 


12s 


3s+9t 


3s+9t 


3s+9t 


3s+9t 
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In the case when the signs are (+,+,—,—), we get ai 
the matrix would be 

/ 2s+2t 2s+2t 4t 4t \ 

/ fi4-3,t x4-M fi4-3,f \ 



S+3t 



and 



2s+2i 


2s+2t 


At 


At 


s+3t 


s+3t 


s+3t 


s+3t 


2s+2t 


2s+2t 


At 


At 


s+3t 


s+3t 


s+3t 


s+3t 


4i 


At 


2s+2t 


2s+2t 


s+3t 


s+3t 


s+3t 


s+3t 


it 


At 


2s+2t 


2s+2t 


s+3t 


s+3t 


s+3t 


s+3t 



(25) 



One can prove that L{Pi) < L{P2) by some calculus technique, for 
example, taking the partial derivative of X(§y with respect to s. In 
similar approaches one can also show that LiP^) < L{P2) and L^P^) < 
L{P2) where P3, P4 are the corresponding matrices for the cases when 
signs are (+, +, 0, — ) and (+, 0, 0, — ) respectively. Thus the matrix in 
(I25II is a global maximum. 

More generally, let {uji^^h be a block matrix with every entry being 
u where li x I2 G and u > 0. 

Conjecture 12. Let n > 2 and < t < s. Then the matrix 



1 



ns + (n — Ijnt 



LfJxLfJ 



is a global maximum for the likelihood function L(P) in [24\ )- 
Conjecture 13. Let n >2 and < s < t. Then the matrix 



( 



2t 



P 



n'^{s+t) 



•n?{s+t) 



\ 



1 



1 



2t 



1 



1 

7:^ 



1 



1 

— ^ 



2s 



n'^{s+t) 



v?{s+t) 



is a global maximum for the likelihood function L(P) in [24\ )- 
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